Qual a série mais bem avaliada? A diferença entre as notas IMDB de cada séries é muito grande?

#Pensar se a média é realmente representativa
series_a_serem_analisadas = read_csv(here("data/series_from_imdb.csv"),
                                     progress = FALSE) %>%
                            filter(series_name %in% c("Mad Men", "Sherlock", "The Killing"))
Parsed with column specification:
cols(
  series_name = col_character(),
  episode = col_character(),
  series_ep = col_integer(),
  season = col_integer(),
  season_ep = col_integer(),
  url = col_character(),
  user_rating = col_double(),
  user_votes = col_double(),
  r1 = col_double(),
  r2 = col_double(),
  r3 = col_double(),
  r4 = col_double(),
  r5 = col_double(),
  r6 = col_double(),
  r7 = col_double(),
  r8 = col_double(),
  r9 = col_double(),
  r10 = col_double()
)
medias_imd_por_serie = group_by(series_a_serem_analisadas, series_name) %>%
                       summarize(media_imdb = mean(user_rating))

Calculamos a nota IMDB de cada série fazendo uma média das notas, dadas pelos espectadores, de cada episódio. Essa nota, por sua vez, é calculada fazendo-se uma média ponderada das notas, variando de 1 a 10, e a quantidade de pessoas que votaram. Portanto, podemos suspeitar que nossa nota IMDB é representativa. Dito isto, temos que, dentre as séries escolhidas, a maior nota é a de Sherlock, aproximadamente 8.9, porém as outras não não estão muito longe disso.

medias_series = plot_ly(medias_imd_por_serie,
                        x = ~series_name,
                        y = ~media_imdb,
                        name = "Média IMDB Séries",
                        type = "bar",
                        color = ~series_name) %>%
                        layout(yaxis = list(title = "Média IMDB"),
                               xaxis = list(title = "Séries"),
                               barmode = "group")
medias_series

No entanto, podemos ver que a The Killing é a que possui uma distribuição de notas mais homogênea, enquanto que a dispersão das notas dos episódios de Mad Men e Sherlock são maiores. Sendo Mad Men a que tem uma maior distância entre a menor e maior nota atribuida. Além disso, podemos perceber que a mediana e a média de cada série estão próximas uma da outra. Confirmando que a média é representativa.

variacoes_notas = plot_ly(series_a_serem_analisadas,
                          x = ~series_name,
                          y = ~user_rating,
                          type = "box",
                          color = ~series_name) %>%
                          layout(yaxis = list(title = "Média IMDB"),
                                 xaxis = list(title = "Série"))
variacoes_notas

Mas será que as avaliações das séries mudam muito de acordo com a temporada?

No gráfico abaixo, podemos observar dois casos interessantes. O público parece não ter gostado muito da última temporada de Sherlock, pois a avaliação da quarta temporada caiu 0.625 em relação a terceira, e é a nota mais baixa atribuída à série. Já The Killing, por mais estranho que pareça, principalmente para quem viu a nota da série no Rotten Tomatoes, parece agradar cada vez mais ao público, mostrando um gráfico sempre crescente.

media_por_temporada = aggregate(series_a_serem_analisadas$user_rating,
                                by = list(series_name = series_a_serem_analisadas$series_name,
                                          season = series_a_serem_analisadas$season),
                                mean)

colnames(media_por_temporada)[3] <- "season_mean"
media_temporada = plot_ly(media_por_temporada,
                          x = ~season,
                          y = ~season_mean,
                          color = ~series_name,
                          type = "scatter",
                          mode = "lines") %>%
                  layout(yaxis = list(title = "IMDB da Temporada"),
                         xaxis = list(title = "Temporada"))

media_temporada
LS0tCnRpdGxlOiAiTWFkIE1lbiwgU2hlcmxvY2sgZSBUaGUgS2lsbGluZyBzZWd1bmRvIG8gSU1EQiIKYXV0aG9yOiAiQ2xhcmEgTW9yYWVzIERhbnRhcyIKZGF0ZTogIjIxIGRlIEFicmlsIGRlIDIwMTgiCm91dHB1dDoKICBodG1sX25vdGVib29rOgogICAgdG9jOiB5ZXMKICAgIHRvY19mbG9hdDogeWVzCiAgaHRtbF9kb2N1bWVudDoKICAgIGRmX3ByaW50OiBwYWdlZAogICAgdG9jOiB5ZXMKICAgIHRvY19mbG9hdDogeWVzCi0tLQoKYGBge3Igc2V0dXAsIGVjaG89RkFMU0UsIHdhcm5pbmc9RkFMU0UsIG1lc3NhZ2U9RkFMU0V9CmxpYnJhcnkodGlkeXZlcnNlKQpsaWJyYXJ5KGhlcmUpCmxpYnJhcnkoZ2dwbG90MikKbGlicmFyeShwbG90bHkpCmBgYAojIyNRdWFsIGEgc8OpcmllIG1haXMgYmVtIGF2YWxpYWRhPyBBIGRpZmVyZW7Dp2EgZW50cmUgYXMgbm90YXMgSU1EQiBkZSBjYWRhIHPDqXJpZXMgw6kgbXVpdG8gZ3JhbmRlPwoKYGBge3J9CiNQZW5zYXIgc2UgYSBtw6lkaWEgw6kgcmVhbG1lbnRlIHJlcHJlc2VudGF0aXZhCnNlcmllc19hX3NlcmVtX2FuYWxpc2FkYXMgPSByZWFkX2NzdihoZXJlKCJkYXRhL3Nlcmllc19mcm9tX2ltZGIuY3N2IiksCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICBwcm9ncmVzcyA9IEZBTFNFKSAlPiUKICAgICAgICAgICAgICAgICAgICAgICAgICAgIGZpbHRlcihzZXJpZXNfbmFtZSAlaW4lIGMoIk1hZCBNZW4iLCAiU2hlcmxvY2siLCAiVGhlIEtpbGxpbmciKSkKCm1lZGlhc19pbWRfcG9yX3NlcmllID0gZ3JvdXBfYnkoc2VyaWVzX2Ffc2VyZW1fYW5hbGlzYWRhcywgc2VyaWVzX25hbWUpICU+JQogICAgICAgICAgICAgICAgICAgICAgIHN1bW1hcml6ZShtZWRpYV9pbWRiID0gbWVhbih1c2VyX3JhdGluZykpCmBgYAoKQ2FsY3VsYW1vcyBhIG5vdGEgSU1EQiBkZSBjYWRhIHPDqXJpZSBmYXplbmRvIHVtYSBtw6lkaWEgZGFzIG5vdGFzLCBkYWRhcyBwZWxvcyBlc3BlY3RhZG9yZXMsIGRlIGNhZGEgZXBpc8OzZGlvLiBFc3NhIG5vdGEsIHBvciBzdWEgdmV6LCDDqSBjYWxjdWxhZGEgZmF6ZW5kby1zZSB1bWEgbcOpZGlhIHBvbmRlcmFkYSBkYXMgbm90YXMsIHZhcmlhbmRvIGRlIDEgYSAxMCwgZSBhIHF1YW50aWRhZGUgZGUgcGVzc29hcyBxdWUgdm90YXJhbS4gUG9ydGFudG8sIHBvZGVtb3Mgc3VzcGVpdGFyIHF1ZSBub3NzYSBub3RhIElNREIgw6kgcmVwcmVzZW50YXRpdmEuIERpdG8gaXN0bywgdGVtb3MgcXVlLCBkZW50cmUgYXMgc8OpcmllcyBlc2NvbGhpZGFzLCBhIG1haW9yIG5vdGEgw6kgYSBkZSBTaGVybG9jaywgYXByb3hpbWFkYW1lbnRlIDguOSwgcG9yw6ltIGFzIG91dHJhcyBuw6NvIG7Do28gZXN0w6NvIG11aXRvIGxvbmdlIGRpc3NvLgoKYGBge3J9Cm1lZGlhc19zZXJpZXMgPSBwbG90X2x5KG1lZGlhc19pbWRfcG9yX3NlcmllLAogICAgICAgICAgICAgICAgICAgICAgICB4ID0gfnNlcmllc19uYW1lLAogICAgICAgICAgICAgICAgICAgICAgICB5ID0gfm1lZGlhX2ltZGIsCiAgICAgICAgICAgICAgICAgICAgICAgIG5hbWUgPSAiTcOpZGlhIElNREIgU8OpcmllcyIsCiAgICAgICAgICAgICAgICAgICAgICAgIHR5cGUgPSAiYmFyIiwKICAgICAgICAgICAgICAgICAgICAgICAgY29sb3IgPSB+c2VyaWVzX25hbWUpICU+JQogICAgICAgICAgICAgICAgICAgICAgICBsYXlvdXQoeWF4aXMgPSBsaXN0KHRpdGxlID0gIk3DqWRpYSBJTURCIiksCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB4YXhpcyA9IGxpc3QodGl0bGUgPSAiU8OpcmllcyIpLAogICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgYmFybW9kZSA9ICJncm91cCIpCm1lZGlhc19zZXJpZXMKYGBgCgpObyBlbnRhbnRvLCBwb2RlbW9zIHZlciBxdWUgYSBUaGUgS2lsbGluZyDDqSBhIHF1ZSBwb3NzdWkgdW1hIGRpc3RyaWJ1acOnw6NvIGRlIG5vdGFzIG1haXMgaG9tb2fDqm5lYSwgZW5xdWFudG8gcXVlIGEgZGlzcGVyc8OjbyBkYXMgbm90YXMgZG9zIGVwaXPDs2Rpb3MgZGUgTWFkIE1lbiBlIFNoZXJsb2NrIHPDo28gbWFpb3Jlcy4gU2VuZG8gTWFkIE1lbiBhIHF1ZSB0ZW0gdW1hIG1haW9yIGRpc3TDom5jaWEgZW50cmUgYSBtZW5vciBlIG1haW9yIG5vdGEgYXRyaWJ1aWRhLiBBbMOpbSBkaXNzbywgcG9kZW1vcyBwZXJjZWJlciBxdWUgYSBtZWRpYW5hIGUgYSBtw6lkaWEgZGUgY2FkYSBzw6lyaWUgZXN0w6NvIHByw7N4aW1hcyB1bWEgZGEgb3V0cmEuIENvbmZpcm1hbmRvIHF1ZSBhIG3DqWRpYSDDqSByZXByZXNlbnRhdGl2YS4KCmBgYHtyfQp2YXJpYWNvZXNfbm90YXMgPSBwbG90X2x5KHNlcmllc19hX3NlcmVtX2FuYWxpc2FkYXMsCiAgICAgICAgICAgICAgICAgICAgICAgICAgeCA9IH5zZXJpZXNfbmFtZSwKICAgICAgICAgICAgICAgICAgICAgICAgICB5ID0gfnVzZXJfcmF0aW5nLAogICAgICAgICAgICAgICAgICAgICAgICAgIHR5cGUgPSAiYm94IiwKICAgICAgICAgICAgICAgICAgICAgICAgICBjb2xvciA9IH5zZXJpZXNfbmFtZSkgJT4lCiAgICAgICAgICAgICAgICAgICAgICAgICAgbGF5b3V0KHlheGlzID0gbGlzdCh0aXRsZSA9ICJNw6lkaWEgSU1EQiIpLAogICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICB4YXhpcyA9IGxpc3QodGl0bGUgPSAiU8OpcmllIikpCnZhcmlhY29lc19ub3RhcwpgYGAKCiMjI01hcyBzZXLDoSBxdWUgYXMgYXZhbGlhw6fDtWVzIGRhcyBzw6lyaWVzIG11ZGFtIG11aXRvIGRlIGFjb3JkbyBjb20gYSB0ZW1wb3JhZGE/CgpObyBncsOhZmljbyBhYmFpeG8sIHBvZGVtb3Mgb2JzZXJ2YXIgZG9pcyBjYXNvcyBpbnRlcmVzc2FudGVzLiBPIHDDumJsaWNvIHBhcmVjZSBuw6NvIHRlciBnb3N0YWRvIG11aXRvIGRhIMO6bHRpbWEgdGVtcG9yYWRhIGRlIFNoZXJsb2NrLCBwb2lzIGEgYXZhbGlhw6fDo28gZGEgcXVhcnRhIHRlbXBvcmFkYSBjYWl1IDAuNjI1IGVtIHJlbGHDp8OjbyBhIHRlcmNlaXJhLCBlIMOpIGEgbm90YSBtYWlzIGJhaXhhIGF0cmlidcOtZGEgw6Agc8OpcmllLiBKw6EgVGhlIEtpbGxpbmcsIHBvciBtYWlzIGVzdHJhbmhvIHF1ZSBwYXJlw6dhLCBwcmluY2lwYWxtZW50ZSBwYXJhIHF1ZW0gdml1IGEgbm90YSBkYSBzw6lyaWUgbm8gUm90dGVuIFRvbWF0b2VzLCBwYXJlY2UgYWdyYWRhciBjYWRhIHZleiBtYWlzIGFvIHDDumJsaWNvLCBtb3N0cmFuZG8gdW0gZ3LDoWZpY28gc2VtcHJlIGNyZXNjZW50ZS4KCmBgYHtyfQptZWRpYV9wb3JfdGVtcG9yYWRhID0gYWdncmVnYXRlKHNlcmllc19hX3NlcmVtX2FuYWxpc2FkYXMkdXNlcl9yYXRpbmcsCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgYnkgPSBsaXN0KHNlcmllc19uYW1lID0gc2VyaWVzX2Ffc2VyZW1fYW5hbGlzYWRhcyRzZXJpZXNfbmFtZSwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgc2Vhc29uID0gc2VyaWVzX2Ffc2VyZW1fYW5hbGlzYWRhcyRzZWFzb24pLAogICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIG1lYW4pCgpjb2xuYW1lcyhtZWRpYV9wb3JfdGVtcG9yYWRhKVszXSA8LSAic2Vhc29uX21lYW4iCmBgYAoKYGBge3J9Cm1lZGlhX3RlbXBvcmFkYSA9IHBsb3RfbHkobWVkaWFfcG9yX3RlbXBvcmFkYSwKICAgICAgICAgICAgICAgICAgICAgICAgICB4ID0gfnNlYXNvbiwKICAgICAgICAgICAgICAgICAgICAgICAgICB5ID0gfnNlYXNvbl9tZWFuLAogICAgICAgICAgICAgICAgICAgICAgICAgIGNvbG9yID0gfnNlcmllc19uYW1lLAogICAgICAgICAgICAgICAgICAgICAgICAgIHR5cGUgPSAic2NhdHRlciIsCiAgICAgICAgICAgICAgICAgICAgICAgICAgbW9kZSA9ICJsaW5lcyIpICU+JQogICAgICAgICAgICAgICAgICBsYXlvdXQoeWF4aXMgPSBsaXN0KHRpdGxlID0gIklNREIgZGEgVGVtcG9yYWRhIiksCiAgICAgICAgICAgICAgICAgICAgICAgICB4YXhpcyA9IGxpc3QodGl0bGUgPSAiVGVtcG9yYWRhIikpCgptZWRpYV90ZW1wb3JhZGEKYGBgCgo=